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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings of claims in the 
application. Applicant has submitted a new complete claim set showing any marked up claims 
with insertions indicated by underlining and deletions indicated by strikeouts and/or double 
bracketing. 

Listing of Claims: 

1. (Currently amended) A method for dynamically updating a collection of information in a 
database including a plurality of pre-existing clusters of information for publication comprising: 

a) extracting from received information a set of characterizing features which 
characterize the received information; 

b) updating the collection of information by g rouping together the received information 
with one or more pre-existing clusters in the collection that have having common characterizing 
features in common with the received information into a number of clusters ; and 

c) using the information obtained in the - grouping stop to publish ing at least a portion of 
the updated collection of information contained in a cluster based on a customer request for 
information. 

2. (Currently amended) The method of claim 1 wherein the received information comprises 
a combination of one or more of text data, image data, ander video data. 

3. (Original) The method of claim 1 wherein said received information comprises multiple 
features of a given type and wherein the multiple features are ranked in importance as the 
features are extracted. 
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4. (Original) The method of claim 3 wherein a cluster includes a summarization of cluster 
features and additionally comprising comparing the features that summarize newly received 
information with features summarized in a cluster by taking an inner product of the features 
common to the newly received information and the features that summarize said cluster and 
combining the newly received information with a cluster if the inner product exceeds a threshold. 

5. (Currently amended) The method of claim 34- wherein a number o f thc top K features of 
rank of a newly received item of information are compared with a corresponding number oft he 
top K features of a cluster to determine if said information is added to a cluster. 

6. (Currently amended) The method of claim 5 wherein each feature has a relevancy factor 
by which the feature is scaled and additionally determining if a cluster and the newly received 
information have at least a number of h common features having non-zero relevancy factors 
before adding the received information into a cluster. 

7. (Original) The method of claim 1 additionally comprising grouping together clusters 
having a common characteristics to produce a neighborhood of clusters which are all published 
in response to a customer request. 

8. (Original) The method of claim 7 wherein the received information is a text containing 
document and a relevancy of a neighborhood is used to determine whether to publish documents 
in a neighborhood to a customer. 

9. (Original) The method of claim 8 wherein the relevancy varies depending on how long 
the document has been assigned to the neighborhood. 
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10. (Original) The method of claim 8 wherein the relevancy varies with information 
contained in the request for information. 

11. (Currently amended) The method of claim 7 wherein an item of received information 
is may be grouped into more than one cluster but published with only one neighborhood. 

12. (Original) The method of claim 11 additionally comprising maintaining a null 
neighborhood and adding received information to the null neighborhood when said information 
is initially received. 

13. (Original) The method of claim 11 additionally comprising maintaining a null 
neighborhood and adding received information to the null neighborhood when contents of a 
neighborhood change due to a reconstituting of said neighborhood. 

14. (Original) The method of claim 1 1 additionally comprising maintaining a null 
neighborhood and adding received information to the null neighborhood when a neighborhood to 
which the received information becomes non-relevant. 

15. (Withdrawn) A process for evaluating documents comprising: 

a) evaluating multiple documents containing text data for subsequent publication by 
extracting K tokens having a highest token relevance factor based on the frequency of token 
occurrence within the document; 

b) grouping together documents having a commonality in said text data that is greater 
than a threshold to provide a number of document clusters of said documents; said grouping 
performed by: 
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i) comparing the K tokens from a candidate document with a document cluster 
characterizing set of tokens; 

ii) adding a candidate document to a document cluster if the comparison indicates 
a sufficient degree of similarity between the candidate document and said document 
cluster; and 

iii) updating a document cluster summarization that takes into account the added 
candidate document; and 

c) publishing documents assigned to a specified document cluster or document clusters 
based upon a request. 

16. (Withdrawn) The process of claim 15 wherein documents have document categories and 
evaluating the token relevance factor comprises determining a category frequency of tokens 
within a document category and assigning a relevance factor to said token based on said category 
frequency. 

17. (Withdrawn) The process of claim 16 wherein tokens are assigned a relevance factor 
based on a position of a token within the document. 

18. (Withdrawn) The process of claim 15 wherein if said candidate document is not 
sufficiently similar to a cluster it forms the basis of its own new cluster. 

19. (Withdrawn) The process of claim 15 wherein the token relevance factor is determined 
from a relation exp(-a*p.sub.Oi)*N.sub.i*R.sub.i, where a is the decay rate of token relevance as 
a function of the distance from the beginning of the text of a document D, p.sub.Oi is the position 
at which token i first appears in the text, N.sub.i is the number of occurrences of token i and 
R.sub.i is the log of the inverse document frequency of token i in the category of documents to 
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which the document D belongs. 

20. (Withdrawn) The process of claim 15 wherein clusters of documents are clustered to 
form neighborhoods of documents to which documents are assigned. 

21. (Withdrawn) The process of claim 16 wherein the neighborhoods are assigned a 
neighborhood relevancy factor which varies with time, said neighborhood relevancy factor used 
to determine to whom a neighborhood is published. 

22. (Withdrawn) The process of claim 21 wherein the neighborhood relevancy number also 
varies with a document relevancy factor of documents that make up the neighborhood. 

23. (Withdrawn) The process of claim 22 wherein the document relevancy factor depends on 
the quality of the source of the document. 

24. (Withdrawn) The process of claim 22 wherein the document relevancy factor depends on 
the location of the source and the location of a requestor. 

25. (Withdrawn) The process of claim 21 wherein the neighborhood relevancy factor varies 
with the category of documents assigned to said neighborhood. 

26. (Withdrawn) A system for evaluating documents comprising: 

a) a preprocessor for receiving text documents from one of a plurality of document 
sources and evaluating text data contained in each received document for determining suitability 
of the document for subsequent publication based on a request; said preprocessor grouping 
together documents having a commonality greater than a threshold to provide a number of 
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clusters of said documents; and 

b) a web server having access to the cluster data from the preprocessor for making 
available to a requester documents contained within a cluster based a comparison between a 
request from the requester and a summarization of text contained within documents of a 
specified cluster or clusters. 

27. (Withdrawn) The system of claim 26 wherein the preprocessor groups together clusters 
into a neighborhood of clusters and further wherein documents within a neighborhood are made 
available to a requester. 

28. (Withdrawn) The system of claim 26 wherein a cluster of documents is removed from 
publication by the web server based on a cluster relevancy of the entire cluster. 

29. (Currently amended) A computer readable storage medium containing instructions for 
executing a method for dynamically updating a collection of information in a database including 
a plurality of pre-existing clusters of information for publication^ the method comprising 
instructions for : 

a) extracting from received information a set of characterizing features which 
characterize the received information; 

b) updating the collection of information by grouping together the received information 
with one or more pre-existing clusters in the collection that have having common characterizing 
features in common with the received information into a number of clusters ; and 

c) using the information obtained in the grouping step to publish ing at least a portion of 
the updated collection of information contained in a cluster based on a customer request for 
information. 
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30. (Currently amended) The computer readable storage medium of claim 29 wherein the 
received information comprises a combination of one or more of text data, image data, ande r- 
video data. 

3 1 . (Currently amended) The computer readable storage medium of claim 29 wherein said 
received information comprises multiple features of a given type and wherein the multiple 
features are ranked in importance as the features are extracted. 

32. (Currently amended) The computer readable storage medium of claim 31 wherein a 
cluster includes a summarization of cluster features and additionally comprising comparing the 
features that summarize newly received information with features summarized in a cluster by 
taking an inner product of the features common to the newly received information and the 
features that summarize said cluster and combining the newly received information with a cluster 
if the inner product exceeds a threshold. 

33. (Currently amended) The computer readable storage medium of claim 3129 wherein a 
number o f tho top K features of rank of a newly received item of information are compared with a 
corresponding number o f the top K features of a cluster to determine if said information is added 
to a cluster. 

34. (Currently amended) The computer readable storage medium of claim 33 wherein each 
feature has a relevancy factor by which the feature is scaled and additionally determining if a 
cluster and the newly received information have at least a number of fc common features having 
non-zero relevancy factors before adding the received information into a cluster. 

35. (Currently amended) The computer readable storage medium of claim 29 comprising an 
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additional step of grouping together clusters having a common characteristics to produce a 
neighborhood of clusters which are all published in response to a customer request. 

36. (Currently amended) The computer readable storage medium of claim 35 wherein a 
relevancy of a neighborhood is used to determine whether to publish documents in a 
neighborhood to a customer. 

37. (Currently amended) The computer readable storage medium of claim 36 wherein the 
relevancy varies with how long the document has been in the neighborhood. 

38. (Currently amended) The computer readable storage medium of claim 36 wherein the 
relevancy varies with information contained in a request for information. 

39. (Currently amended) The computer readable storage medium of claim 35 additionally 
comprising maintaining a null neighborhood and adding received information to the null 
neighborhood when said information is initially received. 

40. (Currently amended) The computer readable storage medium of claim 35 additionally 
comprising maintaining a null neighborhood and adding received information to the null 
neighborhood when contents of a neighborhood change due to a reconstituting of said 
neighborhood. 

41. (Currently amended) The computer readable storage medium of claim 35 additionally 
comprising maintaining a null neighborhood and adding received information to the null 
neighborhood when a neighborhood to which the received information becomes non-relevant. 
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42. (New) The method of claim 1 further comprising updating the collection of information 
by forming a new cluster containing the received information if there are no pre-existing clusters 
in the collection that have characterizing features in common with the received information. 

43. (New) The computer readable storage medium of claim 29 further comprising updating 
the collection of information by forming a new cluster containing the received information if 
there are no pre-existing clusters in the collection that have characterizing features in common 
with the received information. 
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